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This paper concerns the exphcit treatment of substitutions in the lambda calculus. One of its 
contributions is the simplification and rationalization of the suspension calculus that embodies 
such a treatment. The earlier version of this calculus provides a cumbersome encoding of substi- 
tution composition, an operation that is important to the efficient realization of reduction. This 
encoding is simplified here, resulting in a treatment that is easy to use directly in applications. 
The rationalization consists of the elimination of a practically inconsequential flexibility in the 
unravelling of substitutions that has the inadvertent side effect of losing contextual information 
in terms; the modified calculus now has a structure that naturally supports logical analyses, such 
as ones related to the assignment of types, over lambda terms. The overall calculus is shown to 
have pleasing theoretical properties such as a strongly terminating sub-calculus for substitution 
and confiuence even in the presence of term meta variables that are accorded a grafting interpre- 
tation. Another contribution of the paper is the identification of a broad set of properties that are 
desirable for explicit substitution calculi to support and a classification of a variety of proposed 
systems based on these. The suspension calculus is used as a tool in this study. In particular, 
mappings are described between it and the other calculi towards understanding the characteristics 
of the latter. 

Categories and Subject Descriptors: F.4.3 [Mathematical Logic and Formal Languages]: 

Mathematical Logic — Lambda calculus and related systems 

General Terms: Languages, Theory 

Additional Key Words and Phrases: Lambda calculus, explicit substitutions, term rewriting, 
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1. INTRODUCTION 

This paper concerns the exphcit treatment of substitution in the lambda calculus. 
It has a twofold purpose within this context. First, it simplifies and rationalizes a 
particular calculus known as the suspension calculus that provides such a treatment 
[Nadathur and Wilson 1998]. Second, using the resulting system as a basis, it 
attempts to explicate the nuances of and differences between an array of explicit 
substitution calculi that have been proposed in recent years. 

The desire to treat substitution directly in the syntax and rewrite rules of the 
lambda calculus has had a variety of motivations. The suspension calculus was 
developed originally with the intention of supporting a higher-order view of syntax, 
now commonly referred to as higher-order abstract syntax [Pfenning and Elliott 
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1988] or lambda tree syntax [Miller 2000]. Success has been encountered in this 
endeavour: amongst other applications, the notation has been employed in the 
reasoning system called Bedwyr [Bacldc ct al. 2007], in the abstract machine for 
AProlog [Nadathur and Mitchell 1999] and in the implementation of the FLINT 
typed intermediate language [Shao et al. 1998]. Despite its use in practical sys- 
tems, the original suspension calculus manifests some deficiencies. One problem 
is the building in of excessive flexibility in the unravelling of substitutions that 
leads inadvertently to the loss of certain kinds of context information. This added 
flexibility does not really enhance the efficiency of reduction and has unpleasant 
side effects such as the loss of the ability to associate a typing calculus with lambda 
terms. Another problem relates to the encoding of the composition of substitutions. 
Although the notation includes such a capability, its treatment is complicated and 
has led to the description of a derived calculus [Nadathur 1999] that is the one 
usually employed in applications. A drawback with this derived calculus is that it 
does not possess the property of confluence when meta variables arc added to the 
syntax under the so-called grafting interpretation^. At a practical level, this has the 
impact that new approaches to higher-order unification based on using graftable 
meta variables [Dowek et al. 2000] cannot be exploited relative to it. 

One contribution of this paper is the redressing of this situation. In particular, it 
describes a modified treatment of substitution composition that is simultaneously 
natural, easy to use directly in implementations and consistent with contextual 
properties. 

The last fifteen years has seen the description of a large number of explicit substi- 
tution calculi, often without a clear enunciation of the goals underlying their design. 
A consequence of this phenomenon is that it has been difficult to evaluate the differ- 
ent calculi or even to understand the distinctive characteristics of each. This paper 
contributes in a second way by bringing greater clarity to these matters. Specif- 
ically, it identifies three properties that appear important for explicit treatments 
of substitution to support. It then surveys some of the prominent calculi in this 
realm through this prism. The suspension calculus that is developed in the earlier 
sections serves as a tool in understanding the various other systems. Through this 
process, a better grasp is also obtained of the capabilities of this specific notation. 

The rest of the paper is structured as follows. In the next section we describe 
the new version of the suspension calculus. Section 3 then elucidates its proper- 
ties: we show here the strong normalizability and confluence of the sub-calculus for 
treating substitutions and the confluence of the overall calculus even in the pres- 
ence of graftable meta variables. Section 4 discusses other treatments of explicit 
substitutions and contrasts these with the one developed here. Section 5 concludes 
the paper. 

2. THE SUSPENSION CALCULUS 

The modified version of the suspension calculus of Nadathur and Wilson [1998] that 
we present in this section does not sacrifice any of the computational properties of 
the original calculus that are essential to its use in implementations. Rather, it em- 
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bodies a view of it that is easier to reason about and to relate to other approaches to 
expUcit substitutions. In the first two subsections below, we outline the intuitions 
underlying the suspension calculus and then substantiate this discussion through 
a precise description of its syntax and reduction rules. We then discuss the rela- 
tionship of the version of the calculus we present here with the original version and 
also describe variants of it arising from the introduction of meta variables under 
two different interpretations. 

2.1 Motivating the Encoding of Substitutions 

We are interested in enhancing the syntax of the lambda calculus with a new 
category of expressions that is capable of encoding terms together with substitutions 
that have yet to be carried out on them. The kinds of substitutions that wc wish 
to treat are those that arise from beta contraction steps being applied to lambda 
terms. Towards understanding what needs to be encoded in this context, we may 
consider a term with the following structure: 

(...((A...(A...((A...t...) si). ..)...) 52)...) 

We assume here a de Bruijn representation for lambda terms, i.e., names are not 
used with abstractions and bound variable occurrences arc replaced by indices that 
count abstractions back up to the one binding them [Bruijn 1972]. We have elided 
much of the detail in the term shown and have, in fact, focussed only on the 
following aspects: there is a beta redox in it (whose "argument" part is S2) that 
is embedded possibly under abstractions and that itself contains at least another 
embedded beta redex. Contracting the two beta redexes shown should produce a 
term of the form 

(...(... (A. ..(...i'. ..)...). ..)...) 

where t' is obtained from t by substituting S2 and (a modified form of) Si for 
appropriate variables and adjusting the indices for other bound variables to account 
for the disappearance of two enclosing abstractions. Our goal is to represent t' as 
t coupled with the substitutions that are to be performed on it. 

Towards developing a suitable encoding, it is useful to factor the variable refer- 
ences within t into two groups: those that are bound by abstractions inside the first 
beta redex that is contracted and those that are bound by abstractions enclosing 
this redex. Let us refer to the number of abstractions enclosing a term in a par- 
ticular context as its embedding level relative to that context. For example, if we 
assume that every abstraction within the outer beta redex in the term considered 
above has been explicitly shown, then the embedding level of t in this context is 3. 
Rewriting a beta redex eliminates abstractions and therefore changes embedding 
levels. Thus, if the two beta redexes of interest are both contracted, the embedding 
level of t becomes 1. We shall call the embedding levels at a term before and after 
beta contractions the old and new embedding levels respectively. Simply recording 
these with a term is enough for encoding the change that needs to be made to the 
indices for variables bound by the "outer" group of abstractions; in particular, these 
indices must be decreased by the difference between the old and the new embedding 
levels. 
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Substitutions for the other group of variable references, i.e., those bound by 
abstractions within the first beta redex contracted, can be recorded expHcitly in an 
environment. To suggest a concrete syntax, the term t' in the example considered 
may be represented by the expression ft, ol,nl, ej where ol and nl are the old and 
new embedding levels, respectively, and e is the environment. Note that the number 
of entries in the environment must coincide with the old embedding level. It is 
convenient also to maintain the environment as a list or sequence of elements whose 
order is reverse that of the embedding level of the abstraction they correspond to; 
amongst other things, this allowed for an easy augmentation of the environment in a 
top-down traversal of the term. Now, one component of the entry for an abstraction 
that is contracted should obviously be the argument part of the relevant beta redex. 
For an abstraction not eliminated by a contraction, there is no new term to be 
substituted, but we can still correctly record the index corresponding to the first 
free variable as a pseudo substitution for it. In both these cases, we have also to 
pay attention to the following fact: the term in the environment may be substituted 
into a new context that has a larger number of enclosing abstractions and hence 
de Bruijn indices for free variables within it may have to be modified. To encode 
this renumbering, it suffices to record the (new) embedding level at the relevant 
abstraction with the environment entry. The difference between this and the (new) 
embedding level at the point of substitution determines the amount by which the 
free variable indices inside the term being substituted have to be changed. Thus, 
each environment entry has the form (t, I) where t is a term and / is a positive 
number. We refer to the second component of each such entry as its index and 
we observe that the indices for successive environment entries must form a non- 
increasing sequence at least for the simple form of environments we are presently 
considering. 

Once we have permitted terms encoding substitutions into our syntax, it is pos- 
sible for such terms to appear one inside another. A particular instance of this 
phenomenon is when they appear in juxtaposition as in the term 

llt,oli,nli,eij,ol2,nl2, 62] . 

This term corresponds to separately performing two sets of substitutions into t. It 
is useful to have a means for combining these into one set of substitutions, i.e., for 
rewriting the indicated term into one of the form [t, ol' , nl', e']. In determining the 
shape of the new term, it is useful to note that Ci and 62 represent substitutions for 
overlapping sequences of abstractions within which t is embedded. The generation 
of the original term can, in fact, be visualized as follows: First, a walk is made over 
oh abstractions immediately enclosing t, possibly eliminating some of them via beta 
contractions, recording substitutions for all of them in ei and eventually leaving 
behind nh enclosing abstractions. Then a similar walk is made over 0I2 abstractions 
immediately enclosing the term fti, oh, nh, ei], recording substitutions for each of 
them in 62 and leaving behind nl2 abstractions. Notice that the 0I2 abstractions 
scanned in the second walk are coextensive with some final segment of the nh 
abstractions left behind after the first walk and includes additional abstractions if 
0I2 > nh. 

Based on the image just evoked, it is not difficult to see what ol' in the term 
representing the combined form for the substitutions should be: this form represents 
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a walk over oli enclosing abstractions in the case that 0I2 < nl\ and oli + {0I2 — 
nil) abstractions otherwise and ol' should be the appropriate one of these values. 
Similarly, the number of abstractions eventually left behind is n/2 or n/2 + {nli — 0I2 ) 
depending on whether or not nh < 0I2, and this determines the value of nl'. With 
regard to the environment e', this should be composed of the elements of ei modified 
by the substitutions encoded in 62 followed by a final segment of 62 in the case that 
0I2 > nil. The modification to be effected on the elements of ei may be understood 
as follows. Suppose ei has as an element the pair {s,l). Then s is affected by only 
that part of 62 that comes after the first nli — I entries in it. Further, the index of 
the corresponding entry in the composite environment would have to be increased 
from I by an amount equal to 0I2 — nli in the case that 0I2 > nli. Prom these 
observations, it is clear that the merged environment can be generated completely 
from the components ei, nli, 0I2 and €2- We correspondingly choose to encode this 
environment by the expression -^ei, nZi, 0I2, 62 J- 

Our focus here has been on motivating the new syntactic forms in the suspension 
calculus. However, implicit in this discussion has been a "meaning" for these new 
expressions in the sense of a translation into an underlying de Bruijn term. This 
informal semantics will be made precise in the next section through a collection of 
rewrite rules that can be used to incrementally "calculate" the intended encodings. 

2.2 The Syntax of Terms and the Rewriting System 

We now describe precisely the collections of expressions that constitute terms and 
environments in the suspension calculus. We assume that the lambda terms to 
be treated contain constant symbols drawn from a predetermined set. Letting c 
represent such constants, the t and e expressions given by the following rules define 
a "pre-syntax" for our terms and environments: 

t ::= c\#i \ {tt) I (At) I p,n,n,el 
e ::= nil \ {{t,n) :: e) | ^e,n,n,e^ 

In these rules, n corresponds to the category of natural numbers and i represents 
positive integers. Terms of the form {ti 12) and {\t) are, as usual, referred to 
as applications and abstractions. A term of the form #z, known as a de Bruijn 
index, represents a variable bound by the zth abstraction looking outward from the 
point of its occurrence. Expressions of the form \t, ol, nl, e] are called suspensions; 
these constitute a genuine extension to the syntax of lambda terms. The operator 
:: provides the means for forming lists in environments. We use the conventions 
that application is left associative, that :: is right associative and that application 
binds more tightly than abstraction to often omit parentheses in the expressions 
we write. We shall sometimes need to suppress the distinction between terms and 
environments and at these times we shall refer to them collectively as suspension 
expressions or, more simply, as expressions. 

The reason we think of the rules above as defining only the pre-syntax is that we 
expect suspension expressions to also satisfy certain well-formedness constraints. 
In order to enunciate these constraints precisely, we need to associate the notions 
of length and level with environments. We do this through the following defini- 
tions. The symbol - used in these definitions denotes the subtraction operation 
on natural numbers. 
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Definition 2.1. The length of an environment e is denoted by len{e) and is de- 
fined by recursion on its structure as follows: 

(1) len{nil) = 

(2) len{{t,l) :: e) = l + len{e) 

(3) len{\ei,nh,ol2,e2}) = Zen(ei) + (Zen(e2) - nh) 

Definition 2.2. The level of an environment e, denoted by lev{e), is also given 

by recursion as follows: 

(1) lev{nil) = 

(2) lev{{t,l) ■.■.e) = l 

(3) lev{^ei, nh, oh, 62 J) = lev{e2) + {nh - oh) 

The legitimacy requirements that complement the syntax rules is now explicated 
as follows: 

Definition 2.3. A suspension expression is considered well-formed just in case 
the following conditions hold of all its subexpressions: 

(1) If it is of the form |i, ol, nl, e] then len{e) = ol and lev{e) < nl. 

(2) If it is of the form [t, I) :: e then I > lev{e). 

(3) If it is of the form ^ei,nh,oh,e2^ then lev{ei) < nh and len{e2) = oh- 

We henceforth consider only well-formed suspension expressions. We shall also 
sometimes restrict our attention to environments which have the structure of a list 
of bindings. We identify this class of environments below. 

Definition 2.4. A simple environment is one of the form 
(to, ^o) ih,h) ::...:: (tn-ijn-i) ■■ nil 

where by an abuse of notation, we allow n to be 0, in which case the environment 
in question is nil. For < i < n, we write e[i] to denote the environment element 
(ti,li) and e{i} to denote (ti,li) :: ... :: {tn~i,ln-i) ■'■ nil, i.e., the environment 
obtained from e by removing its first i elements. We extend the last notation by 
letting e{i} denote nil in the case that i > len{e) for any simple environment e. 

The rewrite system associated with suspension expressions comprises three kinds 
of rules: the beta contraction rule that generates substitutions, the reading rules 
that distribute them over term structure and the merging rules that allow for the 
combination of substitutions generated by different beta contractions into a com- 
posite one. These three categories correspond to the rules in Figure 1 labelled (/3s)) 
(rl)-(r6) and (ml)-(m6), respectively. The application of several of these rules de- 
pends on arithmetic calculations on embedding levels and indices. We have been 
carehil in the formal presentation to identify such calculations through side condi- 
tions on the rules. However, in the sequel, we will often assimilate such arithmetic 
operations into the rewrite rule itself with the understanding that they are to be 
"interpreted." Using this approach, rule (r6) may have been written instead as 

l{Xt), ol, nl, el ^ (A ft, ol + l,nl + 1, (#1, nl + 1) :: el). 
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(A) ((Ati) t2) ^ [ti,l,0,(t2,0) ■.■.nil}. 

(rl) |c, oi, ni, e] — > c, provided c is a constant. 

(r2) 0, n(, nil} —* #j , where j = i + nl. 

(r3) [#l,o«,n«, (t,0 :: e] |t, 0, nZ', nii], where nZ' = nl-l. 

(r4) [#i,oZ,nZ,(t,0 :: e] ^ oi', n/, e], 

where i' = i — 1 and oi' = o( — 1, provided j > 1. 

(r5) |(ti t2),oi,ni,e] — > ([ti,oi,ni,e] [t2, oi, nJ, e]). 

(r6) [(At),oZ,ni,e] (A [t, o«', ni' , (#1, nZ') :: e]), 
where ol' = ol + 1 and nZ' = + 1. 

(ml) llt,oli,nli,eij,ol2,nl2,e2j |t, oZ', nZ', {ei, nZi, 0^2, 62}], 
where ol' = oli + {0I2 - nh) and nl' = nl2 + {nh - 0I2). 

(m2) -^^ei, rail, 0, nii J ^ ei. 

(m3) fraii, 0, 0^2, £2} — > £2. 

(m4) -g^niZ, nh, 0I2, (t, I) 62} — > -SniZ, nZj, oZj, 62 J, 

where nZ^ = nZi — 1 and oZj = 0Z2 — 1, provided nli > 1. 

(m5) fit,n) :: ei,nli,ol2, {s,l) :: 62} {(*,") ei,ril[, ol!^, €2^} , 
where nZ^ = nZi — 1 and oZj = 0Z2 — 1, provided nZi > ra. 

(m6) {(t, n) :: ei,n,ol2, (s, Z) :: 62} — *■ ([t, oZ2,Z, (s,Z) :: 62], m) :: |ei,n,oZ2, (s, Z) :: 62}, 
where m = Z + (n - oZ2). 

Fig. 1. Rewrite Rules for the Suspension Calculus 

Definition 2.5. Wc say that a suspension expression r is related to ,s by a /3s- 
contraction step, a reading step or a merging step if it is the result of applying the 
(/3s) rule, one of the rules (rl)-(r6) or one of the rules (ml)-(m6), respectively, at 
any relevant subexpression of s. We denote these relations by writing sO^^r, st>r'>' 
and sC>ni?'j respectively. The union of the relations C>r and [>„ will be denoted by 
\>rm, that of >r and by l>r0^ and, finally, that of all three relations by l>rm/3,- 
If R corresponds to any of these relations, we shall write R* to denote its reflexive 
and transitive closure. 

The following theorem shows that these various relations are well-defined. 

Theorem 2.6. The relations [>3^, Cv and \>rn, o-nd, hence, any combination of 
them, preserve well-formedness of suspension expressions. 

Proof. A somewhat stronger property can be proved for the rewriting relations 
of interest: (i) they leave the length of an (environment unchanged, (ii) they never 
increase the level of an environment, and (iii) they preserve well-formedness. These 
facts are established simultaneously by induction on the structure of suspension 
expressions. The base case is verified by considering in turn each rewrite rule in 
Figure 1. The argument is then completed by considering each possibility for the 
structure of an expression and using the induction hypothesis. The details are 
entirely straightforward and hence omitted. □ 

We illustrate the rewrite rules by considering their use on the term 
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((A(AA#1 #2 #3) t2)t3), 

where t2 and ^3 are arbitrary terms. We trace a >rm,/3s "Rewrite sequence for this 
term below: 

((A(AA#1 #2 #3) t2) ts) 

>ftI[A#l #2 #3,l,0,(t2,0) ::™;],l,0,(t3,0) ■.: nilj 

>„[A#1 #2 #3,2,0, i{h,0) ■.■.ml,0,l,{t3,0) ::nil}l 

>mlX #1 #2 #3, 2, 0, ([i2, 1, 0, (is, 0) :: mZ], 0) :: fnil, 0, 1, (is, 0) :: nil}} 

C>„IA#1 #2 #3,2,0,(p2,l,0,(is,0) :: mZl,0) :: (is,0) ■.■.nil}. 

The last expression in this sequence is a term that represents, roughly, the "sus- 
pended" simultaneous substitution of ^2, modified by the substitution of is for its 
first free variable, and of is for the first two free variables in (A#l #2 #3). This 
suspension has been produced by contracting the two beta redoxes in the original 
term and then using the merging rules to combine the two separate substitutions 
that are so generated. The combined environment can now be moved inside the 
abstraction, distributed over the applications and partially "evaluated" using the 
reading rules to yield 

(A#l [[i2,l,0,(i3,0) ::m/l,0.1,m/l) lh,0,l,mlj)). 

This term manifests a structure that may be thought of as a generalization of 
head- normal forms to suspension terms. By applying reading and merging rules 
in accordance with the structure of i2 and is, we may further transform it into a 
head-normal form in the conventional sense. 

The terms in the de Bruijn style presentation of the lambda calculus are a subset 
of the terms in the suspension calculus. In particular, they are exactly the terms in 
the present notation that do not contain any suspensions. Given a rewrite relation 
R, we shall say, as usual, that an expression is in i?-normal form if it cannot be 
further transformed by the rules defining R. It is easily seen then that a suspension 
term is in de Bruijn form just in case it is in [>rm-normal form. We would, of course, 
be interested in knowing if any given suspension expression can be transformed into 
a normal form of this kind. Wo answer this question in the affirmative in the next 
section and subsequently relate the rewrite relations defined here with the usual 
notion of beta reduction over de Bruijn terms. 

2.3 Relationship to the Original Suspension Calculus 

The suspension calculus as we have described it here deviates from the original pre- 
sentation in [Nadathur and Wilson 1998] in a few different ways. One distinction 
arises from the use in the earlier version of the calculus of a special form for the 
environment item that results from percolating a substitution under an abstrac- 
tion. These items arc written as @n where n is a natural number. The rule (r6) 
correspondingly has the form 

|(Ai),oZ,n/,e] ^ (A |i, 0/ -h 1, nl + 1, @n/ :: ej) 

in that setting. This form was introduced into the syntax and treated in special 
ways by the rewrite rules in anticipation of an implementation optimization. It is, 
however, inessential at a theoretical level. In particular, the behaviour of a dummy 
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environment element of the form @n can be completely circumscribed by replacing 
it with (#1, n + 1)^. We assume the impact of this observation below. 

Suspension expressions in the present setting constitute a subset of the expres- 
sions in the original calculus at a pre-syntax level. However, the well-formedness 
condition when restricted to these expressions is different in the two contexts. The 
earlier condition has a form that is identical to the one in Definition 2.3 except that 
the requirement on the levels of environments is replaced by one on their indices, 
a notion that is defined below. 

Definition 2.7. Given a natural number i, the i-th index of an environment e is 
denoted by indi{e) and is defined as follows: 

(1) If e is nil then ind,i{e) = 0. 

(2) If e is (t, k) :: e' then indi(e) is /c if i = and indi-i{e') otherwise. 

(3) If e is -§^ei, nl, ol, e2^, let m — {nl - indi{ei))^ and I = len{ei). Then 



The index of an environment, denoted by ind{e), is ind('i{e). 

Any given environment expression e is expected to be reducible to a simple one of 
the form (to.^o) • • • " (^n-i.^n-i) " nil. The i-th index of e is then precisely /, 
if i < n and otherwise. The level of e. in contrast, only estimates the 0-th index 
when e is reduced to this simple form while retaining information that is needed 
for interpreting intermediate expressions in the rewriting process. Nevertheless, we 
can observe the following: 

Lemma 2.8. The well-formed expressions of the suspension calculus as described 
in this paper are a subset of the well-formed ones of the original presentation. 

Proof. We prove the following by induction on the structure of a suspension 
expression that is well- formed under the criterion in this paper: (a) the expression 
is also well-formed under the earlier criterion and (b) if the expression is an environ- 
ment e, then lev{e) > ind{e) and ii i > j then indi{e) > indj{e). These properties 
must be shown simultaneously: the induction hypothesis pertaining to (b) is needed 
for establishing (a) and we need to know that the expression is well-formed in the 
earlier sense in order to establish (b). The details are straightforward once these 
observations are made and hence we omit them here. The lemma is an immediate 
consequence of property (a). □ 

The final difference between the two versions of the suspension calculus is in the 
treatment of the composition of two environments. In the earlier presentation, the 
outer environment is distributed eagerly over the elements of the inner one. This 
is done by a rule of the form 



^It should be noted, though, that the parsimony of the latter form is complemented by the 
introduction of more (perhaps unnecessary) possibilities for rewriting that considerably complicate 
the proof of termination for the reading and merging rules. 
^The - here can be replaced by — for well-formed expressions. 
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fet :: ei,nl,ol,e2j {{et , nl , ol , 62)) :: fei,nl,ol,e2j, 

where {{et,nl, ol, 62)) represents an augmentation to the syntax of environment 
items for encoding the effect of transforming et by the relevant substitutions in 
62- The older version of the calculus has rules relating to expressions of the form 
{{et, nl,ol, €2)) that facilitate the pruning of €2 down to a part that really affects et 
and the subsequent generation of a suspension that captures its influence on the 
term component. By contrast, the present rendition of the calculus calculates the 
effect of 62 on :: ei by first pruning 62 down to a relevant part based on et and 
only later distributing the refined environment to e\. 

It follows naturally from the observations made above that the rules (m2), (m5) 
and (m6) do not appear in the original rendition of the suspension calculus. How- 
ever, based on the discussions already in [Nadathur and Wilson 1998], it can be 
seen that each of these rules is admissible to the earlier version in the sense that 
their left and right hand sides can be rewritten to a common form in that setting. 
We can, in fact, make the following observation, a detailed proof of which appears 
in [Gacek 2006b]: 

Lemma 2.9. Let xi and X2 be suspension expressions such that Xi\>*^X2. As- 
sume further that X2 is in l>7-yy^-7ior?7^tt/ form. Then x\ also rewrites to X2 by virtue 
of the reading and merging rules in [Nadathur and Wilson 1998]. 

Our focus up to this point has been on arguing that the suspension calculus as 
described here is a subsystem of sorts of the original presentation. It is important, 
of course, to also address the issue of why such a "subsystem" is of interest. There 
are several reasons for this, all arising out of the modified treatment of substitution 
composition. First, this treatment is a considerably simplified one and can, as 
a consequence, be used directly in practical applications. Second, it rectifies a 
problem with the original calculus that prevented certain interesting logical analyses 
over terms from being formulated: it is, for instance possible to describe a type 
assignment system now for terms [Gacek 2006b] , something that was difficult to do 
with the original suspension calculus. Finally, this change is crucial to our ability 
to describe formal correspondences of the suspension calculus with other explicit 
substitution calculi later in this paper. 

While there may be justifications for the modified suspension calculus, there is 
also a question about its adequacy. It is evident that this version can still treat 
substitutions explicitly and that it possesses the important capability of composing 
such substitutions. In the next section we see also that properties such as confluence 
and the ability to simulate the usual notion of beta reduction over lambda terms 
are preserved, thus settling any concern over adequacy. 

2.4 Permitting Meta Variables In Suspension Terms 

The syntax of suspension expressions does not presently allow for instantiatable 
variables. Such variables, also referred to as meta variables, are often used within 
lambda terms in situations such as those of higher-order theorem proving and 
symbolic manipulation of higher-order objects. In the former context, these vari- 
ables arise naturally in attempts to prove existential statements: such proofs in- 
volve choosing instantiations for existential quantifiers and meta variables provide 
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a means for delaying actual choices till there is enough information for determining 
what they should be. In the latter context, instantiatable variables are instrumen- 
tal in realizing structure recognition capabilities relative to the use of higher-order 
abstract syntax based representations of constructs whose structures involve bind- 
ing notions. For example, consider the first-order formula Va;((p x)\/ {q x)). Using 
an abstraction to capture the binding content of the quantifier, this formula can 
be rendered into the lambda term {all A (or {p #1) {q #1))), where all and or are 
constants chosen to encode universal quantification and disjunction in formulas. 
Given such representations, the lambda term {all A (or {P jj^l) {Q #1))) in which 
P and Q are meta variables serves as a pattern for recognizing formulas that at the 
top-level have the structure of a disjunction embedded within a universal quantifier. 

An important question concerning meta variables is that of how substitutions 

for them are to be treated. The logically correct interpretation of these variables 
requires that such substitutions respect the notion of scope. Thus, if X is an 
instantiatable variable that has an occurrence within an abstraction context, the 
term that is substituted for it cannot contain a bound variable that is captured by 
the enclosing abstraction. This view is one that also supports rather useful pattern 
matching capabilities. To understand this, we might reconsider the "template" 
we have described above for first-order formulas. Suppose that we want to refine 
this so that the formulas recognized by it arc such that the right subpart of the 
disjunction does not depend on the top-level quantifier. If a treatment of meta 
variables in accordance with logical principles is used, then the following modified 
template achieves this purpose: {all \ {or {P #1) Q))). The critical facet that 
ensures this behaviour is that no structure that is substituted for Q can have a 
variable occurrence in it that is captured by the abstraction corresponding to the 
quantifier. 

An alternative possibility to the logical view of instantiatable variables is to treat 
them as placeholders against which any well-formed term can be grafted. This 
kind of "grafting" interpretation has been found useful in conjunction with explicit 
substitution notations in, for instance, realizing a new approach to unification in 
the context of lambda terms [Dowek et al. 2000] . The well-known procedure due 
to Huet [1975] calculates unifiers incrementally and requires the construction of 
a complicated term, the contraction of beta redexes and the calculation of their 
substitution effects all for the sole purpose of percolating dependency information 
to places where they can be used in later computation steps. By allowing meta 
variables to be substituted for by terms with variable occurrences that can be 
captured by enclosing abstractions, the dependencies can be transmitted by a much 
simpler process. Of course, treating instantiatable variables in this "graftable" way 
seems contradictory to their logical interpretation and also appears to fly in the face 
of pattern matching applications. However, a reconciliation is possible: variables 
can be interpreted initially in a logical way but then surrounded in an explicit 
substitution context so that a subsequent grafting treatment does not violate the 
required logical constraints. Thus, consider again the term {all \ {or [P jf^l) Q)). 
This term may be transformed into {all A (or {[P' , 0, 1, nil] #1) {Q' , 0, 1, nilf)). By 
identifying P and Q with the terms {P' ,0,1, nil} and [Q' ,0,1, nil}, we insulate 
substitutions for them from a dependence on the external abstraction even under 
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a grafting interpretation of P' and Q' . 

Either of the discussed views of meta variables can be built into the suspension 
notation. Towards this end, we first modify the syntax for terms to the following: 

t ::= v\c\m\{tt)\ (Ai) | Ii,n,n,e|, 

where v represents the category of instantiatable variables. If we interpret these 
variables in the logical way, then they cannot be affected by substitutions generated 
by /3-contractions. To support this view, therefore, we add the following to our 
reading rules: 

(r7) ol, nl, e] —^v/iivisa. meta variable. 

If, on the other hand, the grafting interpretation is chosen, then this rule is not 
acceptable and the original rewriting system, in fact, remains unchanged. 

The choice of interpretation impact on the properties of the calculus in different 
ways. Under the logical view, meta variables behave like constants in that they may 
be replaced only by closed terms; this fact is explicitly manifest in the similarity 
of rule (r7) to (rl). Thus, all the properties of the calculus that includes them are 
already manifest in the subsystem described in Section 2.2. The situation is more 
intricate under the grafting view. For example, consider the term ((A ((AX) t\)) ^2) 
in which X is an instantiatable variable and ti and t2 are terms in [>rm-normal form. 
This term can be rewritten to 

I|X,l,0,(ti,0)::mZl,l,0,(t2,0)::mZl 

and also to 

2, 1, (#1, 1) :: (t2, 0) :: nil}, 1, 0, (pi, 1, 0, (^2, 0) :: m^], 0) :: ml], 

amongst other terms. It is easy to see that these terms cannot now be rewritten 
to a common form using only the reading and {(3^) rules. The merging rules are 
essential to this ability. As we sec in Section 3, those also suffice for this purpose. 

We assume henceforth that the suspension calculus includes meta variables and 
that these are implicitly accorded the grafting interpretation. For reasons already 
mentioned, it is easy to sec that the properties we establish for the resulting calculus 
will hold also under the logical interpretation. 

3. PROPERTIES OF THE SUSPENSION CALCULUS 

We now consider the coherence of the suspension calculus. Suspensions and the 
associated reading and merging rules are intended mainly to provide control and 
variability over substitution relative to the lambda calculus. In keeping with the 
finite nature of the substitution process, we would expect the reduction relations 
defined by these rules to be always terminating. We show this to be the case in the 
first subsection. There are evidently choices to bo made in the application of the 
reading and merging rules. Regardless of how these choices are made, it is important 
that we produce the same normal form. We show that this confluence property holds 
in the second subsection below. We then digress briefly to establish an interesting 
structural property of the suspension calculus which relates two different methods 
for encoding the renumbering of bound variables; this property is used in the next 
section in relating the suspension calculus to the Aa-calculus. Finally, we prove 
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that confluence continues to hold when the {(3g) rule is added to the collection and 
that this full system is also capable of simulating beta reduction over de Bruijn 
terms. 

3.1 Strong Normalizablllty for Substitution Reductions 

There arc two steps to our argiimcnt that any sequence of rcwritings based on 
the reading and merging rules must terminate. First we identify a collection of 
first-order terms over which we define a well-founded ordering using a variant of 
recursive path ordcrings [Dcrshowitz 1982; Fcrrcira and Zantcma 1995]. Wc then 
describe a translation from suspension expressions to this collection of terms that 
is such that each of the relevant rewrite rules produces a smaller term relative to 
the defined order. The desired conclusion follows from these facts. 

The terms that are intended to capture the essence of suspension expressions 
vis-a-vis termination are constructed using the following (infinite) vocabulary: the 
0-ary function symbol * the unary function symbol lam, and the binary function 
symbols app, cons and, for each positive number i, Si. We denote this collection of 
terms by T. We assume the following partial ordering □ on the signature underlying 
T: Si □ Sj if i > j and, for every z, Sj □ app, Sj □ lam, Si □ cons and Sj □ *. This 
ordering is now extended to the collection of terms. 

Definition 3.1. The relation )~ on T is inductively defined by the following prop- 
erty: Let s = /(si, . . . , Sm) and t = g{ti, . . . , both s and t may be *, i.e., the 
number of arguments for either term may be 0. Then s y t ii 

(1) f = g (in which case n = m), (si, . . . , s„) yiex {ti, ■ ■ ■ , tn), and, s y U for all i 
such that 1 < i < n, or 

(2) / □ 5 and s y ti for all i such that 1 < i < n, or 

(3) Si = t or Si y t for some i such that 1 < i <m. 

Here yiex denotes the lexicographic ordering induced by >-. 

In the terminology of [Ferreira and Zantema 1995], >- is an instance of a recursive 
path ordering based on It is easily seen that □ is a well-founded ordering on 
the signature underlying T. The results in [Ferreira and Zantema 1995] then imply 
the following: 

Lemma 3.2. y is a well-founded partial order on T. 

We now consider the translation from suspension expressions to T. The critical 
part of this mapping is the treatment of expressions of the form lt,ol,nl,e] and 
^ei,nl,ol,e2^. Our translation ignores the embedding level components of these 
expressions and transforms them into terms whose top-level function symbol is Sj 
where i is a coarse measure of the remaining substitution work. In estimating this 
effort in a sufficiently fine-grained way relative to an abstraction, it is necessary to 
take cognizance of the following fact: rule (r6) creates a "dummy" substitution for 
the bound variable that is then adjusted by generating a "renumbering" suspension 
using rule (r3). To account for this additional work, we define a family of measures 
that relativizes the complexity of an expression to the number of enclosing suspen- 
sions. In calculating this quantity it is important to observe that the substitution 
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via rule (r3) of a term in an environment results in it being embedded in an ad- 
ditional suspension. We quantify the maximum such "internal embedding" below 

and then use this in estimating the substitution effort. In these definitions, max is 
the function that picks the larger of its two integer arguments. 

Definition 3.3. The measure jj that estimates the internal embedding potential 
of a suspension expression is defined as follows: 

(1) For a term t, ii{t) is if t is a constant, a meta variable or a de Bruijn index, 
/i(s) if t is (As), max{iJ,{si), iJ.{s2)) if t is (si S2), and fj.{s) + jiie) + 1 if i is 
|s, ol, nl, ej. 

(2) For an environment e, /i(e) is if e is nil, max{fi{s), At(ei)) if e is (s, I) :: ei and 
/i(ei) -|-/u(e2) + 1 if e is {^ei,nl,ol,e2j- 

Definition 3.4. The measures rji on terms and environments for each natural 
number i arc defined simultaneously by recursion as follows: 

(1) For a term t, rji{t) is 1 if i is a constant, a meta variable or a de Bruijn 
index, ryi(s) + 1 if t is (As), max{r]i{si),r]i{s2)) + 1 if t is {s\ S2),and ?7i+i(s) + 
Vi+i+t^(s) {P-) + 1 if t is |s,o/,n/,e]. 

(2) For an environment e, rii{e) is if e is nil, max{r]i{s),r]i{ei)) if e is {s,l) :: ei 
and 77i+i(ei) -|-?7j+i+^(ei)(e2) + 1 if e is lei,nl, 01,62}. 

The measure rio is meaningfully used only relative to suspensions. In this context, 
it estimates, in a sense, the maximum effort along any one path in the substitution 

process rather than the cmnulativc effort. 

Definition 3.5. The translation £ of suspension expressions to T is defined as 
follows: 

(1) For a term t, £{t) is * if f is a constant a meta variable or a de Bruijn index, 
app{£{ti),£{t2)) if Ms {ti tj), lam{£(t')) litis {Xt') and Si{£{t'),£{e')) where 
i = r]o{t) if t is ol, nl, e'J. 

(2) For an environment e, £{e) is * if e is nil, cons{£{t'),£{e')) if e is {t' ,1) :: e' 
and Si{£{ei),£{e2)) where i = r/o(e) if e is ^ei,nl, 01,62}- 

We are now in a position to prove the strong normalizability of the substitution 

reduction relations. 

Theorem 3.6. Every rewriting sequence based on the reading and merging rules 
terminates. 

Proof. A tedious but straightforward inspection of each of the reading and 
merging rules verifies the following: If Z — > r is an instance of these rules, then £{1) >- 
£{r), ijl{1) > fi{r), and, for every natural number i, r]i{l) > r]i{r). Definition 3.1 
ensures that >- is monotonic, i.e., if v results from u by the replacement of a subpart 
x hy y such that x )^ y, then u )^ v. Further, it is easily seen that if x and y 
are both either terms or environments such that ^(x) > /x(j/) and r]i{x) > r]i{y) 
for each natural number i and if v is obtained from u by substituting y for x, 
then r]i{u) > rji{v) for each natural number i. From these observations it follows 
easily that if ti>rm^2 then £{ti) >- £(t2)- The theorem is now a consequence of 
Lemma 3.2. □ 
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As an interesting side note, we observe that the termination proof presented here 
has been formally verified using the Coq proof assistant [Gacek 2006a]. 

3.2 Confluence for the Substitution Calculus 

Theorem 3.6 assures us that every suspension expression has a >rm-normal form. 

From observations in Section 2 it follows therefore that every snspcnsion term can 
be reduced to a de Bruijn term and every environment can be rewritten to one in 
a simple form using the reading and merging rules. We now desire to show that 
these normal forms arc unique for any given expression. This would immediately 
be the case if we have the property of confluence, i.e., if for any s, u and v such 
that s>*„'U and s>*^v we know that there must be a f such that ut>*^t and v\^^t. 
A well-known result, proved, for instance, in [Huet 1980], is that confluence follows 
from a weaker property known as local confluence for a reduction relation that is 
terminating. In our context this translates to it being sufficient to show for any 
suspension expression s that if s>rmU and st>rmV then there must be an expression 
t such that u\>*j^t and v\>*^t. The usual method for proving local confluence for 
a rewrite system is to consider the different interfering ways in which pair of rules 
can be applied to a given term and to show that a common term can be produced 
in each of these cases. We use this approach in proving local confluence for the 
reading and merging rules here. The most involved part of the argument concerns 
the interference of rule (ml) with itself. We discuss this situation first and then 
use our analysis in proving the main result. 

3.2.1 An associativity property for environment composition. The expression 
lilt, oh, nil, eij, 0I2, nh, 62], 0^3, nlz, esj can be transformed into a form correspond- 
ing to the term t under a substitiition represented by a single environment in two 
different ways by using rule (ml). The composite environments in the two cases 
are given by the expressions 

•g^l^ei, nh, oh, 62 J, nh + {nh - oh), oh, es} 

and 

lei,nh,oh + {oh -nh),ie2,nh,oh,e3}J. 

Conceptually, these environments correspond to first composing ei and 62 and then 
composing the result with 63 or, alternatively, to composing ei with the result of 
composing 62 with 63. An important requirement for local confluence is that these 
two environments can be made to converge to a common form, i.e., environment 
composition must, in a sense, be associative. Wc show this to be the case here. 
The argument we provide is inductive on the structures of the three environments 
and has the following broad outline: Based on the specific context, we consider 
the simplification of one; of the two environments by relevant reading and merging 
rules. We then show that the other expression can also be rewritten, possibly by 
using the same rules, either to the same expression as the first or to an expression 
that is amenable to the use of the induction hypothesis. 

We begin by noting some properties of the reading and merging rules that are 
useful in filling out the details of the proof. The first of these relates to the second 
environment displayed above and has the following content: At some point in the 
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reduction of this expression, it becomes possible to apply the rules relevant to 
evaluating the composition of 62 and 63. Applying these rules immediately does 
not limit the normal forms that can be produced. This observation is contained in 
the next two lemmas. 

Lemma 3.7. Let A be the environment ^ei,nli,oli, ^62,7112,01^,63^^ where 63 
is a simple environment and 62 is of the form {t2,n2) " e'2. Further, for any positive 
number i such that i < n/2 — n2 and i < 0I3, let B be the environment 

lei,nh,oli, f 62, n/2 - i, ok - i, esii}}}- 

If A[>*^C for any simple environment C then also B\>*^C. 

Proof. It suffices to verify the claim when i = 1; an easy induction on i then 
extends the result to the cases where i > 1. For the case of « = 1, the argument is 
by induction on the length of the reduction sequence from A to C with the essential 
part being a consideration of the first rule used. The details are straightforward 
and hence omitted. □ 

Lemma 3.8. Let A be the environment \^ei,nli,oli,\e2,nl2, 01^,63^^ where €2 
and 63 are environments of the form {t2,nl2) ■■ e'2 and (tsjns) :: 63, respectively. 
Further, let B be the environment 

^ei,nli,oli, (1^2, 0/3, n3, 631,713 + {nh-oh)) :: \e'^,nl2,oh,ez}^ . 

If A:>*^C for any simple environment C then also B\>*^^C. 

Proof. The proof is again by induction on the length of the reduction sequence 
from A to C. The first rule in this sequence either produces B, in which case 
the lemma follows immediately, or it can be used on B (perhaps at more than 
one place) to produce a form that is amenable to the application of the induction 
hypothesis. □ 

In evaluating the composition of 62 and 63, it may be the case that some part 
of 63 is inconsequential. The last observation that we need is that this part can be 
"pruned" immediately in calculating the composition of the combination of 61 and 
62 with 63. The following lemma is consequential in establishing this fact. 

Lemma 3.9. Let A be the environment -§^61, n/i, 0/2, 62 J where 62 is a simple 
environment. 

{!) If 0I2 < nil — lev{ei) then A reduces to any simple environment that ei reduces 
to. 

(2) For any positive number i such that i <nli — lev{ei) and i < 0I2, A reduces to 
any simple environment that -^61, nli — i, 0I2 — i, e2{i} J reduces to. 

Proof. Let ei be reducible to the simple cnvironmcnit e[. Then we may trans- 
form A to the form ^e[,nli, 0I2, 62 J. Recalling that the level of an environment is 
never increased by rewriting, we have that lev{e'i) < lev{ei). From this it follows 
that A can be rewritten to e[ using rules (m5) and (m2) if 0I2 < nli — lev{ei). This 
establishes the first part of the lemma. 

The second part is nontrivial only if nli — lev{e\) and 0I2 are both nonzero. 
Suppose this to be the case and let B be -8^61,71/1 — 1,0/2 — l)e2{l}J. The desired 
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result follows by an induction on i if we can show that A can be rewritten to any 
simple environment that B reduces to. We do this by an induction on the length 
of the reduction sequence from B to the simple environment. This sequence must 
evidently be of length at least one. If a proper subpart of B is rewritten by the 
first rule in this sequence, then the same rule can be applied to A as well and the 
induction hypothesis easily yields the desired conclusion. If B is rewritten by one 
of the rules (m3)-(m6), then it must be the case that Al>rmB via either rule (m4) or 
(m5) from which the claim follows immediately. Finally, if B is rewritten using rule 
(m2), then 0I2 < nli — lev{ei). The second part of the lemma is now a consequence 
of the first part. □ 

We now prove the associativity property for environment composition: 

Lemma 3.10. Let A and B be environments of the form 

ilei,nli,ol2,e2},nl2 + {nh - 0I2), 013,63} 

and 

lei,nli,ol2 + {0I3 - nh), {,62, nh, 013,63}}, 

respectively. Then there is a simple environment C such that A>*^C and Bt>*^C. 

Proof. We assume that ei, 62 and 63 are simple environments; if this is not the 
case at the outset, then we may rewrite them to such a form in both A and B before 
commencing the proof we provide. Our argument is now based on an induction on 
the structure of 63 with possibly further inductions on the structures of 62 and e\. 

Base case for first induction. When 63 is nil, the lemma is seen to be true by 
observing that both A and B rewrite to \^6i,nl\, oh, 62} by virtue of rule (m2). 

Inductive step for first induction. Let 63 = {t3,n3) :: 63. We now proceed by an 
induction on the structure of 62- 

Base case for second induction. When 62 is nil, it can be seen that, by virtue of rules 
(m2), (m3) and either (m4) or (m5), A and B reduce to §ei, nli, 0I3 — nh, 63{nl2}} 
when 0I3 > nh and to ei otherwise. The truth of the lemma follows immediately 
from this. 

Inductive step for second induction. Let 62 = {t2,n2) 62- We consider first the 
situation where nli > lev{6i). Suppose further that 0I3 < {nh — n2). Using rules 
(m5) and (m2), we sec then that 

B[>*.^fei, nh, oh, 62}. 

We also note that 0I3 < {nh + {nh - oh)) — l6v{^6i, nh, oh, 62}) in this case. 
Lemma 3.9 assures us now that A can be rewritten to any simple environment that 
^61, nh, 0I2, 62} reduces to and thereby verifies the lemma in this case. 
It is possible, of course, that 0I3 > {nh — n2). Here we see that 

B^rmi^i^nh - I, oh + {0I3 - nh) - 1, 

1^62, n2, 0I3 - {nh - n2), e3{nh - n2}}}. 

using rules (m5) and (m6). Using rule (m5), we also have that 
^>*mi-5ei, nh - 1, oh - 1, 62}, nh + {nh - oh), 0I3, 63}. 
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Invoking the induction hypothesis, it fohows that A and 

{{ei, nil - 1, 0I2 + {oh - nh) - 1, {{63, nl2,ol3, 63!}} 

reduce to a common simple environment. By Lemma 3.7 it follows that B must 
also reduce to this environment. 

The only remaining situation to consider, then, is that when nli = lev{ei). For 
this case we need the last induction, that on the structure of ei. 

Base case for final induction. If ei is nil, then nli must be 0. It follows easily that 
both A and B reduce to -8^62, nZ2, 0/3, 63 J and that the lemma must therefore be 
true. 

Inductive step for final induction. Here ei must be of the form {ti,nli) :: e'^. We 
dispense first with the situation where n2 < nh. In this case, by rule (m5) 

B>*^iei,nli,ol2 + {ol3-nl2),ie2,nl2 - 1,0/3 - 1,63^}. 

By the induction hypothesis used relative to e'^, B and the expression 

i^ie-i,nh,ol2,e2'},nl2 + {nh-oh) - I, oh - IjGsS^ 

must reduce to a common simple environment. By Lemma 3.9, A must also reduce 
to this environment. 

Thus, it only remains for us to consider the situation in which n2 = nh- In this 
case by using rule (ml) twice we may transform A to the expression A/j :: A^ where 

Ah = (11^1,0/2,^2, 62!, 0/3,713, 63], n3 + ((nZ2 + (nh - oh)) - oh)) 

and 

At = tie'^,nli,ol2,e2},nl2 + {nh - 0/2), 0Z3, 63^. 

Similarly, B may be rewritten to the expression Bh :: Bf where 

Bh = {lti,oh + {oh-nh),n3 + {nl2-oh), 

(1*2, 0/3, 713, 63], n3 + (71/2-0/3)) :: Se'a, n/2, 0/3, 63 Jl, 
ns + {nh - oh) + {nh - {oh + {oh - nh)))) 

and 

Bt = fe'i, nli,oh + {oh - nh), 

(p2, 0/3, ns, 63], n3 + {nh - oh)) :: -8^62, nh, oh, es}}- 

Now, using straightforward arithmetic identities, it can be seen that the "index" 
components of Ah and Bh are equal. Further, the term component of Ah can be 
rewritten to a form identical to the term component of Bh by using the rules (ml) 
and (m6). Finally, by virtue of the induction hypothesis, it follows that At and the 
expression 

^e[, n/i, 0/2 + (0/3 - nh), i_e2, nh, oh, 63 

reduce to a common simple environment. Lemma 3.8 allows us to conclude that Bi 
can also be rewritten to this expression. Putting all these observations together it 
is seen that A and B can be reduced to a common simple environment in this case 
as well. □ 
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3.2.2 Uniqueness of Substitution Normal Forms. We can now show that t>rm is 
a locally confluent reduction relation. 

Lemma 3.11. For any expressions s, u and v such that s>rmU and s>rmV there 
must be an expression t such that ut>*j^t and v>*j^t. 

Proof. We recall the method of proof from [Huet 1980]. An expression t con- 
stitutes a nontrivial overlap of the rules -Ri and R2 at a subexpression s if (a) t is 
an instance of the lefthand side of Ri , (b) s is an instance of the lefthand side of 
i?2 and also does not occur within the instantiation of a variable on the lefthand 
side of Ri when this is matched with t and (c) either s is distinct from t or Ri is 
distinct from R2. Let ri be the expression that results from rewriting t using Ri 
and let r2 result from t by rewriting s using R2. Then the pair (ri, r2) is called the 
conflict pair corresponding to the overlap in question. Relative to these notions, 
the lemma can be proved by establishing the following simpler property: for every 
conflict pair corresponding to the reading and merging rules, it is the case that the 
two terms can be rewritten to a common form using these rules. 

In completing this line of argument, the nontrivial overlaps that we have to 
consider are those between (ml) and each of the rules (rl)-(r6), between (ml) 
and itself and between (m2) and (m3). The last of these cases is easily dealt 
with: the two expressions constituting the conflict pair are identical, both be- 
ing nil. The overlap between (ml) and itself occurs over a term of the form 
IIP, o?i, n/i, d], 0^2, «^2, 62!, o?3, n/3, 63]. By using rule (ml) once more on each 
of the terms in the conflict pair, these can be rewritten to expressions of the form 
ft, ol', nl', e'] and ft, ol", nl" , e"], respectively, whence we can see that ol' = ol" and 
nl' = nl" by simple arithmetic reasoning and that e' and e" reduce to a common 
form using Lemma 3.10. The overlaps between (ml) and the reading rules are also 
easily dealt with. For instance consider the case of (ml) and (rl). Using rule (rl), 
the two terms in the conflict pair can be rewritten to the same constant. The other 
cases are similar even if a bit more tedious. □ 

As observed already, the main result of this subsection follows directly from 
Lemma 3.11 and Theorem 3.6. 

Theorem 3.12. The relation Oj-m is confluent. 

The uniqueness of l>r»n-iiormal forms is an immediate consequence of Theo- 
rem 3.12. In the sequel, a notation for referring to such forms will be useful. 

Definition 3.13. The notation \t\ denotes the >*^-normal form of a suspension 
expression t. 

It is easily seen that the l>r-m-normal form for a term that does not contain meta 
variables is a term that is devoid of suspensions, i.e., a de Bruijn term. A further 
observation is that if the all the environments appearing in the original term are 
simple, then just the reading rules sufHce in reducing it to the de Bruijn term that 
is its unique Oj-^-normal form. 

3.3 An Equivalence Property Relating to Renumbering Substitutions 

An important role for the subcalculus for substitutions is that of realizing the 
renumbering of de Bruijn indices necessitated by beta contractions. One mechanism 
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t 



t 



e 



e 



ti ^ t'^ t2 t2 



t^t' e ~ e' 



ti t2 ~ t[ t'^ 



(t, n) :: e ~ (t', n) :: e' 



ei ~ e'l 62 ~ 62 



At ~ At' 



t ~ t' e ~ e' 



[t,oZ,nZ,e] ~ p',oZ,nZ,e'] 



(t,n) ~ (t',n) 




(p,oi,nZ,r],nZ + fc) :: e ~ ([*', oZ,nZ', r'; 



'},nl' + k) :: e' 



Fig. 2. The similarity relation, 



for controlling such renumbering is the new embedding level in a suspension, i.e., 
the value chosen for nl in an expression of the form oZ,nZ,e]. Looking at the 
reading rule (r3), wc sec that another component that determines renumbering is 
the index of an environment term, i.e., the value chosen for n in an item of the form 
{t, n) in an environment. Now, these different mechanisms appear in juxtaposition 
in an environment item of the form o?, n/, e], n). We observe here that >rm- 
normal forms are invariant under a coordinated readjustment of the renumbering 
burden between the two devices in such an expression. 

The permitted reapportionment is expressed formally through the notion of sim- 
ilarity defined below. 

Definition 3.14. The similarity relation between (well- formed) terms and envi- 
ronments, respectively, is denoted by ~ and is given by the rules in Figure 2. 

The property of interest is then the following: 

Theorem 3.15. Ift and t' are terms such that t <^t' , then \t\ = \t'\. If e and e' 
are environments such that e ~ e', then they rewrite by reading and merging rules 
to similar simple environments. 

Proof. Only a sketch is provided here; a detailed proof may be found in [Gacek 
2006b]. Using the translation function from Definition 3.5, we define the relation 
» on suspension expressions as follows: u 3> (' just in case £{u) >- £{v). Obviously 
» is a well-founded partial order. It is also easily seen that w » if either w is a 
sub-expression of u or u>rm?;. 

The argument is now an inductive one based on the ordering induced by ^> on 
pairs of expressions. In filling out the details, when considering two expressions u 
and V such that u ^ the additional properties of » and the induction hypothesis 
allow us to assume that any similar subparts of u and v that are terms are identical 
and that are environments are simple. We then consider the difi'erent cases for the 
structures of u and v and the rewriting rules that are applicable to them. The only 
nontrivial case when u and v are terms arises when these are suspensions to which 
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rule (r3) is applicable and the environment parts of these terms are similar but not 
identical. In this case we have 

u = ol,nl, {ltr,olr,nlr,r},nlr + k) :: e| 

>(r3) Iftrj Olr, Tilr, ^J, 0, Til — {ulr + fc), nil\ 

l>(ml) I^r, O^r, nl — {nlr + k) + Tllr, ^V, Tllr, 0, mZ J] 

[>(m2) ltr,0lr,nl - fc, r| 

V = ol,nl,{ltr,olr, nil, r'},nl'^ + k) ■■ e'l 
>(r3) llU, olr,nl'^,r'j,0,nl - {nl'^ + k),nilj 
>(mi) Pr, olr, nl - {nl'^ + k) + nl'^, Ir', nl'^, 0, nil}} 

>im2) ltr,0lr,nl - fc, r'] 

By assumption, r ~ r'. Since u ^ ftr, olr, nl — k, r] and v ^ ftr, ol^, nl — k, r'J, the 
induction hypothesis yields the desired conclusion. For environments, the nontrivial 
cases arise when u and v are of a form to which the rules (m5) or (m6) apply. The 
argument here is similar albeit more tedious. □ 

Theorem 3.15 casts an interesting light on rule (m6) of the suspension calculus. 
This rule has the form 

{{t, n) :: ei, n, oh, {s, I) :: 62 J 

(p, ol2.,L {s, I) :: 62}, m) :: {d, n, 0I2, {s, I) :: 623- 

where m = I + {n — oh). The righthand side of the rule has an environment item 
in which both an index and a new embedding level is chosen. Observe that a value 
larger than I could also be used for the new embedding level so long as the index is 
correspondingly modified and it remains consistent with the context in which the 
replacement is performed. Intuitively, this would correspond to eagerly relativizing 
ft, 0I2, 1, (s, I) :: 62] to a context with a larger number of enclosing abstractions and 
taking cognizance of this in its subsequent substitution. 

3.4 Confluence for the Full Calculus 

Now we turn to the confluence of the system given by the rules in Figure 1 that 
includes the {f3s) rule in addition to the ones for interpreting substitutions. In 
establishing this property, we adopt the method used in [Curicn et al. 1996] to 
demonstrate that the Acr-calculus is confluent. The following lemma, proved in 
[Curien et al. 1996], is a critical part of the argument. 

Lemma 3.16. Let TZ and S be two reduction relations defined on a set X with 
TZ being confluent and strongly normalizing and S satisfying the property that for 
every t, u and v such that tSu and tSv there is an s such that uSs and vSs. 
Further suppose that for every t, u and v such that tSu and tTZv there is an s 
such that uU* s and V {11* US UTZ*) s. Then the relation n*LlSuTZ* is confluent. 

In applying this lemma, we shall utilize the parallelization of t>p^ that is defined 
below. 

Definition 3.17. The relation >g^|| on suspension expressions is defined by the 
rules in Figure 3. 
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t — » t e ^ e 

ti -> t[ t2 -> t'2 t-^t' e -» e' 

tl t2 t'l (t, e -> (f, /) :: e' 

ei — > ej 62 —» 62 
|ei,nii,oi2,e2j — > |ej,nii, 0^2, 

t ^ t' e ^ e' 
\t,ol,nl,e\ — > [t',oi,nZ,e'] 

tl ^ t'l t2 ^ 
(Ati) t2 ^ [t'l, 1,0, (i2,0) :: nil} 

Fig. 3. Rules defining \>p^\\ 

Theorem 3.18. The relation >rmi3s *^ confluent. 

Proof. Let 1Z be >rm and let <S be 1>^^||. We observe then that 

>™/33C(7^*u5u7^*)c>;^^^. 

Thus {1l*USun*)* = >*^f^^ and hence O^m/s, would be confluent if (7^* U<Su7e*) 
is. 

To establish the latter we use Lemma 3.16, interpreting TZ and S as per the 
nomenclature of the lemma. We have already seen that l>rm is both confluent 
and strongly normalizing. To show that if t^^^nu and t[>^^||U then there is an s 
such that ul>^^||S and t;l>^^||S, we argue by induction on the structure of t and by 
considering the rules by which u and v are obtained. The only non-trivial case is 
that when t is the term (Xti) t2, one of u and v is [t'l, 1, 0, (<2, 0) :: nilj and the 
other is (At") ^2 where til>p^^\t[, ^i^g^n^i, t2>i3^\\t'2 and t2l>/3,||t2- ^y the induction 
hypothesis, there exists an si such that t'i[>^^||Si and t']'>^^||Si and an S2 such that 
^2^AII"^2 and t2'>/3s||*2- We then pick s as |si, 1, 0, (s2, 0) :: nil}; obviously u>^^||S 
and v\>i3^i\s. 

It only remains for us to show that for any t, u and v such that t>a,||M and 
tOrrnV thcrc is an s such that m[>*^s and w U l>/3,|| U s. Wc do this again 
by induction on the structure of t. The argument is straightforward in all cases 
except perhaps when t is |(Afi) t2,ol,nl,el, v is |Ati, oZ,n/,e] |t2, oZ,nZ,e] and u 
is 1^1,1,0,(^2,0) :: nil},ol,nl,e'} where ti>/3^||t'i, t2t>/3^\\t'2 and O^^ye'. However, 
if we pick s to be 

lt[,ol + l,nl, {lt'2,ol,nl,e'j,nl) :: e'J 

we can easily show that it satisfies the requirements, thus completing the argument 
even in this case. □ 

Theorem 3.18 strengthens the confiuence result established for the original sus- 
pension calculus in [Nadathur and Wilson 1998] in that it shows that this property 
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holds even when meta variables are permitted in terms. Although we have only 
shown this property to hold for the refinement of the suspension calculus presented 
here, our argument can be easily adapted to the original version. 

3.5 Simulation of Beta Reduction 

A fundamental requirement of any explicit substitution calculus is that it should 
allow for the simulation of beta reduction in the usual A-calculus. In framing this 
requirement properly for the suspension calculus, it is necessary, first of all, to re- 
strict attention to the situation where meta variables do not appear in terms. In 
this setting, as observed already, the lambda calculus terms under the de Bruijn no- 
tation are exactly those suspension terms that are devoid of suspensions. Moreover, 
beta contraction, denoted by >/3, is defined as follows: 

DefinMion 3.19. Let f be a dc Bruijn term and let si,S2,S3,... represent an 
infinite sequence of de Bruijn terms. Then the result of simultaneously substituting 
Si for the i-th free variable in f for i > 1 is denoted by S{t; si, S2, ss, . . .) and is 
defined recursively as follows: 

(1) S{c; si,S2, S3, . . .) = c, for any constant c, 

(2) 5'(#z; Si, S2, S3, • • •) = Si for any variable reference #i, 

(3) S{{ti t2); si,S2, S3, . . .) = {S{tr, si, S2, S3, . . .) S{t2; si, .S2, ,S3, • • •)), and 

(4) S{{Xt);si,S2,S3,...) = {XS{t;#l,s[,s'2,s's,...)) where, for i > 1, 
s^ = 5(s,;#2,#3,#4,...). 

Using this substitution operation, the /3-contraction rule is given by the following 

((Ati) t2) ^^(ii;i2,#l,#2,...). 

A de Bruijn term t is related via /3-contraction to s if s results from t by the 
application of this rule at an appropriate subterm. We denote this relationship by 
>/?. Beta reduction is the reflexive and transitive closure of >/3. 

One part of the relationship between the suspension and lambda calculi that may 
also be viewed as the soundness of the {f3s) rule is the following: 

Theorem 3.20. Lett and s be suspension terms such that t\>0^s. Then \t\l>*p\s\. 

Proof. This theorem is proved for the original suspension calculus in [Nadathur 
and Wilson 1998]. The result carries over to the version of the calculus presented 
here by virtue of Lemma 2.9. □ 

The ability of the suspension calculus to simulate beta reduction is a suitably 
stated converse to the above theorem. 

Theorem 3.21. Let t and s be de Bruijn terms such that tt>*^s. Then t >*^i3^s. 

Proof. It has been shown in [Nadathur and Wilson 1998] for the original for- 
mulation of the suspension calculus that if tl>^s then i>*^^s. This observation 
carries over to the present version since the rules defining \>ri3s have essentially 
been preserved. The theorem obviously follows from this. □ 
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4. COMPARISON WITH OTHER EXPLICIT SUBSTITUTION CALCULI 

We now survey some of the other explicit treatments of substitutions that have been 
proposed and contrast them with the suspension calculus. We restrict our atten- 
tion in this study to calculi that utilize the de Bruijn scheme for representing bound 
variables. A good approach to imdcrstanding such calculi is to characterize them 
based on properties that are desired of them over and above their ability to encode 
substitutions. These are three such properties in our understanding: the ability 
to compose reduction substitutions, confluence in a situation where graftablc meta 
variables are included and the preservation of strong nor maliz ability for terms in 
the underlying lambda calculus. The first of these properties is central to combining 
substitution walks in normalization. Without it, for instance, the reduction of the 
term (AAti) h would require two separate traversals to be made over ti for the 
purpose of substituting and ts for the relevant bound variables in it. The sec- 
ond property is important in developing algorithms that exploit the grafting view 
of meta variables. For example, confluence in the presence of such variables is a 
central requirement in realizing a new approach to higher-order unification [Dowek 
et al. 2000]. The final property has both a theoretical and a practical significance. 
At a theoretical level, it measures the coherence of the calculus. Explicit treatments 
of substitution are obtained usually by adding a terminating set of rules for car- 
rying out the substitutions generated by beta contractions. The non-preservation 
of strong normalizability should, in this setting, be read as an undesirable interfer- 
ence between different parts of the overall rewrite system. At a practical level, this 
signifies that caution must be exercised in designing normalization procedures. 

Of these various properties, the one that appears to be most important in prac- 
tice is the ability to combine reduction substitutions: studies show that it is central 
to the efficient implementation of reduction [Liang et al. 2004] , and, as indicated in 
Section 2, it also appears to be a natural way to realize confluence in the presence 
of graftable meta variables. Unfortunately, the majority of the explicit substitution 
calculi seem not to include this facility. Particular calculi sacrifice other properties 
as well. The Au-calculus [Benaissa et al. 1996] preserves strong normalizability but 
does not permit graftable meta variables. The Ase-calcuhis permits such variables 
and is confluent even with this addition [Kamareddine and Rfos 1997] but does not 
preserve strong normalizability [Guillaume 2000]. The AC-calculus [Munoz 1996] 
possesses both properties but obtains confluence by effectively requiring beta re- 
dexes to be contracted in an innermost fashion. Amongst the systems that do not 
permit the combination of substitutions, the A^g^-calculus alone preserves strong 
normalizability and realizes confluence in the presence of graftable meta variables 
without artificially limiting reduction strategies [David and Guillaume 2001]. 

The only systems that permit the combination of reduction substitutions are, 
to our knowledge, the Au-calculus [Abadi et al. 1991], the closely related ACCL 
calculus [Field 1990] and the suspension calculus. The first two calculi are prac- 
tically identical and, for this reason, we restrict our discussion of them to only 
the Aa-calculus. The suspension and the Aa-calculus both admit graftable meta 
variables without losing confluence and they are similar in many other respects as 
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well^. However, they have two important differences. One of these relates to the 
manner in which they represent substitutions. The Acr-calculus encodes these as 
independent entities that can be separated from the term that they act on. This is 
a pleasant property at a formal level but it also leads to inefficiencies in the treat- 
ment of the renumbering of bound variables that is necessary when a substitution 
is moved under an abstraction. The second difference concerns the treatment of 
bound variables. In the Aa-calculus, these are encoded as environment transform- 
ing operators in contrast to their representation directly as de Bruijn indices in 
the suspension calculus. The former representation is parsimonious in that rules 
that serve to compose substitutions can also be used to interpret bound variables. 
However, there are also disadvantages to such parsimony. It appears more difficult, 
for example, to separate out rules based on purpose and, hence, to identify simpler, 
yet complete, subsystems as has been done for the suspension calculus [Nadathur 
1999] . The ambiguity in function also appears to play a role in the non-preservation 
of strong normalizability in the Acr-calculus [Mellies 1995]: although the status of 
this property for the suspension calculus is as yet undetermined, a more focussed 
treatment of substitution composition disallows the known counterexample for the 
Acr-calculus to be reproduced within it. 

In the rest of this section we use the suspension calculus as a means for under- 
standing the different treatments of explicit substitutions in more detail. We also 
attempt to substantiate the qualitative comparisons that wc have provided above. 
Our approach to doing this is to describe translations between the suspension calcu- 
lus and the other calculi that illuminate their differing characteristics. None of the 
calculi that we consider treat constants in terms and, for the sake of consistency, 
we assume these are missing also in suspension terms. We also do not include meta 
variables initially since these arc not present in all calculi, but we bring them into 
consideration later as relevant. Wc divide our discussion of the other calculi into 
two subsections depending on whether or not they possess an ability to combine 
substitutions. As we shall see below, the calculi that do not have a combining 
capability correspond substantially to the suspension calculus without the merging 
rules. 

4.1 Calculi Without Substitution Composition 

We discuss three calculi under this rubric: the Au-calculus [Benaissa et al. 1996], 
the As-calculus [Kamareddine and Rios 1995], and the Asg-calculus [Kamareddine 
and Rfos 1997]. Qualitatively, these calculi provide an increasing sequence of capa- 
bilities. When the de Bruijn representation is used for lambda terms, the indices of 
externally bound variables in a term have to be incremented when it is substituted 
under an abstraction. The A-u-calcuhis requires such renumbering to be carried 
out in separate walks for each abstraction that the term is substituted under. The 
As-calculus improves on this situation by permitting all the renumbering walks to 
be combined into one although such a walk is still kept distinct from walks that 
realize substitutions arising out of beta contractions. The Asg-calculus extends the 



^To be accurate in spirit as well as in detail this statement needs a qualification: as wc discuss 
later in the section, the original rewrite system of the Acr-calculus needs to be extended slightly 
to obtain confluence in the presence of graftable meta variables. 
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(B) 


(Xa) a[b/] 


(VarShift) 


n[T] ^ n + 1 


(App) 


(a b)[s] als] b[s] 


(FVarLift) 


iim] ^ 1 


(Lambda) 


(Aa)[s] ->Aa[ir(s)] 


(RVarLift) 


n + l[-fl-(s)] -»nM[T] 


(FVar) 


l[a/] a 






(RVar) 


n + l[a/l — » n 







Fig. 4. Rewrite rules for the Au-calculus 

As-calculus by permitting graftable meta variables. 

4.1.1 The \v-calculus. The syntax of this calculus comprises two categories: 
terms, corresponding to lambda terms possibly encoding explicit substitutions, and 
substitutions. 

Definition 4.1. The terms, denoted by a and h and the substitutions, denoted 
by s, of the Af-calculus are given by the following syntax rules: 

a ::= n\ a b \ \a \ a[s\ 
s ::= a/ \ i[{s) \ T 

The collection of expressions described may be understood intuitively as follows. 
The expression n represents the n*'' de Bruijn index, analogously to #n in the 
suspension calculus. The binary operator _[_], referred to as a closure, introduces 
explicit substitutions into terms. The expression a/, created using the operator 
/ called slash, represents the substitution of a for the first dc Bruijn index and a 
shifting down of all other de Bruijn indices. The substitution ^(.s), which uses the 
operator ff called lift, provides a device for pushing substitutions underneath ab- 
stractions. Finally, the expression f, called shift, represents the effect of increasing 
the de Bruijn indices corresponding to externally bound variables by one. 

The interpretations of the various syntactic devices are made explicit by the rules 
in Figure 4 that define the Au-calculus. The rule labelled (B) in this collection 
emulates beta contraction by generating an explicit substitution. The rest of the 
rules, that constitute the sub-calculus v, serve to propagate such substitutions over 
the structure of a lambda term and to eventually evaluate them at the bound 
variable occurrences. 

In relating the suspension and the Ai;-calculus it is natural to identify the syn- 
tactic categories of terms in the two settings and to think of environments in the 
former framework as corresponding to substitutions in the latter. There is, how- 
ever, an important difference in the view of the latter two entities. Substitutions in 
the Au-calculus are self-contained objects that carry all the information needed for 
understanding them in context. In contrast, the interpretation of an environment 
requires also an associated old and new embedding level in the suspension calculus. 
This intuition underlies the following translation of Xv to suspension expressions. 

Definition 4.2. The mappings T from terms in the Aw-calculus to terms in the 
suspension calculus and E from substitutions in the Aw-calculus to triples consisting 
of two natural numbers and a suspension environment are defined by recursion as 
follows: 
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(1) For a term t, T{t) is #n if Hs n, {T{a) T{h)) if Hs (a h), \ T{a) if Ms A a, and 
|T(a), ol, nl, e| if t is a[s] and -B(s) = {ol, nl, e). 

(2) For a substitution s, £;(s) is (1, 0, (T(a), 0) :: nil) if s is a/, (0, 1, nil) if s is t, 
and {ol + l,nl + 1, {#l,nl + 1) :: e) if s is tr(s') and E{s') = {ol,nl,e). 

It is easy to see that T{a) must be a well-formed suspension term for every term 
a in the Af-calculus. The difference in representation of bound variables in the 
two calculi is clearly only a cosmetic one and wc shall ignore it in the discussion 
that follows. It is obvious then that T is a translation that preserves de Bruijn 
terms. It can also be easily verified is that T and E are one-to-one mappings. 
There are, however, many suspension terms that are not the images under T of any 
term in the Aw-calculus: the set of substitutions that can be encoded in the latter 
calculus is quite limited. There are, in fact, only two forms that substitutions can 
take: f|~(. . . i[{a/) . . .), corresponding to preserving the first few do Bruijn indices, 
substituting a (with appropriate renumbering) for the next one and decreasing the 
remaining indices by one, and . . f|~(t) • ■ •), corresponding to preserving the first 
few de Bruijn indices and then incrementing the remaining ones by one. Thus, the 
Af-calculus cannot encode an expression such as ft, 0, 2, nil], where t is a de Bruijn 
term, directly. This expression can be represented indirectly by t[t][T] that has 
the suspension term 0, 1, niZ], 0, 1, nil] as its image. This encoding highlights a 
problem with the manner in which the Af-calculus treats renumbering of de Bruijn 
indices: incrementing by n has to be realized through n separate walks that each 
increment by 1 . A more drastic example of the limitations of the Aw-calculus is that 
it possesses no simple way to encode the suspension term {t, 1,2, (s, 2) :: nil} that 
corresponds to substituting s for the first de Bruijn index in t and incrementing 
all the remaining indices by two. Finally, wc note that only simple environments 
appear in terms that are in the image of T. This is, of course, to be expected since 
the the Au-calculus does not support the ability to compose substitutions. 

At the level of rewriting, we would expect the Aw-calcuhis to translate into the 
subcalculus of the suspension calculus that excludes the merging rules. This is true 
for the most part: it is easily seen that if Z — > r is an instance of any rule in Figure 4 
other than (FVar) and (RVarLift), then T{1) T{r) is an instance of either the 
{(3s) rule or one of the reading rules in Figure 1. For the (FVar) rule, we observe 
first that the {t, 0, 0, nil] — > t is an admissible rule in the suspension calculus in the 
absence of graftablc meta variables. Now, this fact can be used to build a special 
case of (r3) into the rewrite system: 

(r3') [#l,oZ,0,(i,0)::e]^t 

The (FVar) rule corresponds directly to (r3') under the translation we have de- 
scribed. 

The situation for the (RVarLift) rule is more involved. Any term that matches 
its lefthand side translates into a suspension term of the form 

[#(n + 1), ol + l,nl + 1, {#l,nl + 1) :: e] 

where either e is nil, in which case ol is and nHs 1, or e has a first element of the 
form {t,nl). In the suspension calculus, rule (r4) allows this term to be rewritten 
to the form 
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[#n,o/,T7i + l,e]. 

In the case that e is nil, this suspension corresponds to incrementing the indices for 
externally bound variables in a de Bruijn term, constituted here by #n, by 2. If e is 
of the form (t, nl) :: e' on the other hand, then the suspension represents a situation 
in which one or more terms are to be substituted into a context that includes more 
enclosing abstractions than were present in the context of their origin. The Xv- 
calculus is capable of representing neither situation directly but can encode both 
indirectly via a term that translates to 

||#n, o;,n/,e],0,l,mZ]. 

This is, in fact, the translation of the righthand side of the (RVarLift) rule. This 
term can be reduced to |#n, oZ, nZ + 1, e] by using the merging rules but represents 
the introduction of an extra renumbering walk in the absence of these rules. 

The above discussion casts light on the efficiency with which beta reduction can 
be realized using the two calculi considered here. Normal forms for suspension 
expressions involving only simple environments are identical whether or not the 
merging rules are utilized. From this it follows easily that the normal forms pro- 
duced by the two systems must be identical. 

4.1.2 The Xs-calculus. The As-calculus also distinguishes between beta contrac- 
tion and renumbering substitutions. However, it differs from the Au-calculus in that 
it possesses a more general mechanism for renumbering de Bruijn indices and also 
has a more concise way of recording which de Bruijn indices are actually affected 
by beta contraction and renumbering substitutions. These devices are manifest in 
the syntax of terms. 

Definition 4.3. The terms of the As-calculus, denoted by a and b, are given by 
the rules 

a ::= n\ab\Xa\aa^b \ ^p\a 

where n and i range over positive integers and k ranges over non- negative integers. 

Towards understanding this syntax, we observe first that de Bruijn terms are rep- 
resented in the As-calculus exactly as they are in the suspension calculus with the 
cosmetic difference that the n^^ de Bruijn index is denoted directly by n rather 
than #n. Beyond this, there are two additional kinds of expressions that serve to 
make substitutions explicit. A term of the form a a* b, called a closure and intended 
to capture a beta contraction substitution, represents the substitution of a suitably 
renumbered version of b for the i*^ de Bruijn index in a and a shifting down by one 
of all de Bruijn indices greater than z in a. A term of the form ^p\ a, called an update 
and included to treat renumbering, represents an increase by i — 1 of all de Bruijn 
indices greater than k. The purpose of these new kinds of expressions becomes clear 
from the rewriting rules for the As-calculus that are presented in Figure 5. The 
a-generation rule is the counterpart of beta contraction in this collection. The re- 
maining rules, referred to collectively as the s rules, serve to calculate substitutions 
introduced into terms by applications of the a-generation rule. 

Closures and updates can be understood as special forms of suspensions. This 
relationship is made precise by the following definition. 
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a-generation (-^ h ^ aa^ b 

a-\-transition (Xa) a' b X (a a^'^^ b) 

a-app-transition (ai 02) a'^ b ^ (ai tr* b) (02 6) 

!n — 1 if n > i 
(Pgb if n = i 
n if n < i 

ip-\-transition ip^(Aa) ^ A (ip^^^-^ a) 

(fi-app-transition Vki'^i '^2) — * ('/'fe'^l) (v^'^a) 

\ n + i — 1 if n > k 
(p-destruction ~* ^ 

In if n < fe 

Fig. 5. Rewrite rules for the As-calculus 

Definition 4.4. The translation T of terms in the As-calculus to suspension terms 
is defined by recursion as follows: 

' #n if t is n 

T{a) T{b) if i = (a b) 

XT{a) i{t = Xa 
T{t)={ lT{a),i,7.-l,m,i-l) :: 

(#l,i-2) (#1,1) :: (T(6),0) :: nil} iit = aaHa.nd 

lT{a),k,k + i-l,{#l,k + i-l) :: 

(#1, k + i -2) (#1, i) :: nilj H t = a. 

The image of the translation function T is, once again, evidently a subset of the 
well-formed suspension terms. At a rewriting level, the As-calculus is, in a sense, 
contained within that fragment of the suspension calculus that excludes the merging 
rules. Towards making this comment precise, we observe first that the following 
is a derived rule of this fragment of the suspension calculus, assuming that e is a 
simple environment: 



l#n,ol,nl,el = < 



^{n — ol + nl) if n > oZ, 

#{nl — l + l) if n < ol and e[n] = (#1, 1), and 
^ ft, 0, nl — I, nil} otherwise, assuming e[n] = {t, I). 



In particular, this rule embodies a sequence of applications of the rules (r2)-(r4) 
from Figure 1. Now, if we augment the reading rules to also include this rule, then 

the following theorem is easily proved: 

Theorem 4.5. If a and b are terms of the Xs-calculus such that a rewrites to b 
in one step using the rules in Figure 5, then then T{a)>ri3^T{b). 

Noting that de Bruijn terms are preserved under the translation, we see then that 
any normalization sequence in the As-calculus can be mimicked in a one-to-one 
fashion within this fragment of the suspension calculus. 

The comments above indicate a correspondence at a theoretical level but they 
gloss over issues relevant to the practical implementation of reduction. First, as 
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a-a-transition 



a-ip-transition 1 

cr-ip-transition 2 
if-a-transition 



(f-ip-transiticm 2 



f-if-transiticm 1 



(ao-' fe) CT-' c ^ (ao-J+i c) o-'(6ct-'-'+i 
((^^ a)a^ b -+ vj,"^ a 

V'KW «) ^ "^i ('^fe+l-j ") 



if i < j 

iik<j<k + i 



ii k + i < j 
if i < fc + 1 
if ; + j < fe 



if i < < / + j 



Fig. 6. Additional rewrite rules for the Ase-calculus 



the translation function indicates, the As-calculus provides a rather succinct en- 
coding for the substitutions that arise when only the reading and the /?s rules are 
used. Second, the s rules utilize this representation to realize substitution rather 
efficiently in this context; observe, in this regard, that the derived reading rule actu- 
ally embodies a possibly costly "look-up" operation that is necessary relative to the 
more elaborate encoding of substitutions used in the suspension calculus. However, 
this efficiency has an associated cost: closures in the As-calculus represent exactly 
one beta contraction substitution and, consequently, multiple such substitutions 
must be effected in separate walks. By contrast, even simple environments in the 
suspension calculus have the flexibility for encoding multiple beta contraction and 
arbitrary renumbering substitutions. Moreover, the merging rules are not needed 
in their full generality to exploit this capability: simple to implement derived rules 
can be described for this purpose [Nadathur 1999]. It has been observed that the 
ability to combine substitutions that is supported by the more general encoding for 
them leads to significantly greater efficiency in realizing reduction in practice than 
does the concise encoding facilitated by treating restricted forms of substitutions 
[Liang et al. 2004]. 

4.1.3 The Xsg-calculus and perm,uta,tions of substitutions. The As-calculus and 
the Aw-calculus lack confluence in the presence of graftable meta variables. In 
the absence of substitution composition, the only way to regain confluence is to 
permit permutations of substitutions"''. In the context of the As-calculus, such 
permutability should apply to both the closure and the update forms of explicit 
substitutions. The Ase-calculus adds the rules in Figure 6 to those already present 
in the As-calculus in support of such permutability. There must, of course, be 
some kind of directionality to the permitted substitution reorderings to ensure 
termination and the side conditions on the new rules are intended to realize this. 
To understand the use of these rules and also the restrictions on permutations, we 
may consider the term ((A ((AX) ti)) ^2)- Mimicking in the Ase-calculus the two 
reduction paths seen for this term in Section 2.4, we get the terms {X t\) t2 
and (X a'^ t2) <7^{ti t2). Notice now that the a-a-transition rule is applicable 
only to the first of these terms. Thus, intuitively, this rule permits the permutation 
only of substitutions arising from the contraction of outer beta redexes over those 
arising from contracting inner ones. The effect of carrying out this rearrangement 



^We note here that permutation and composition of substitutions are distinct notions although 
they seem sometimes to have been confused in the literature, e.g., see [Cosmo et al. 2003]. 
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is to make the substitutions have the same form in both terms, as is desired. 

The Ase-calculus has been shown to have an adequate mix of permutation rules to 
ensure confluence in the presence of meta variables [Kamarcddinc and Rios 1997]. 
From the discussion of the a-a-transition rule it might appear that it also restricts 
these rules sufficiently to preserve strong normalizability. Unfortunately, this is not 
the case: it has been shown that interactions between closures and updatings can 
give rise to nontermination even when the starting point is a lambda term that 
can be simply typed [Guillaume 2000]. The A,„s-calculus [David and Guillaume 
2001] provides a remedy to this situation by extending the syntax of de Bruijn 
terms (and hence the normal forms produced by reduction) to include terms with 
numeric labels that represent yet- to-be-computed renumbering substitutions. 

4.2 Calculi with Substitution Composition 

As we have noted, the main exemplars of this variety of treatment of explicit sub- 
stitutions arc the Act- and the suspension calculi. Wc discuss their relationship 
below. In contrast to the earlier situations, it is now relevant to consider mappings 
between these calculi in both directions. 

4.2.1 The \(j -calculus. The Acr-calculus, like the Au-calculus that is derived 

from it, treats substitutions as independent entities that can be meaningfully sep- 
arated from the terms they act upon. Thus, its syntax is determined by terms and 
substitutions. 

Definition 4.6. The following syntax rules in which a and b denote terms and s 
and t denote substitutions define the syntax of the Acr-calculus: 

a ::= 1 | a 6 | Aa | a[s\ 
s ::= id \ a ■ s \ s o t \ f 

A term of the form a[s] is called a closure and represents the term a with the 
substitution s to be applied to it. The substitution id is the identity substitution. 
The substitution a • s is called cons and represents a term a to be substituted for 
the first de Bruijn index along with a substitution s for the remaining indices. 
The substitution sot represents the composition of the substitution s with the 
substitution t. Finally, the substitution t is called shift and is intended to capture 
the increasing by 1 of all the de Bruijn indices corresponding to the externally 
bound variables in the term it is applied to. A form of substitution that has 
special significance ist°(T° •■■ (Tot)"")- Assuming n occurrences of t in 
the expression, such a substitution represents an n-fold increment to the de Bruijn 
indices of the externally bound variables in the term it operates on. The shorthand 
t" is used for such an expression and the notation is further extended by allowing 
t'^ to denote id. 

The reference to dc Briiijn indices in the previous paragraph is accurate in spirit 
but not in detail. The Acr-calculus represents abstracted variables as environment 
transforming operators rather than as indices. Specifically, only the first abstracted 
variable is represented directly by the index 1: for n > 1, the n-th such variable; is 
represented by When such a term is subjected to a substitution, the shift 

operators will play a role in determining the appropriate term to replace it with, as 
the rules of the calculus will elucidate. It will become clear then that composition 
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(Beta) (Ao) 6 ^ a[fe ■ id] 



(App) [a b)[s] ^ a[s] b[s] 

(Abs) (Aa)[s] ^ Aa[l • (s o T)] 

(Varld) l[id\^l 

(VarCons) 1 [a • s] — > a 

(Clos) a[s][t] -> a[sot] 



(Map) (a • s) o t -» o[t] - {sot) 

(Ass) (s o t) o u — > s o (t o u) 

(IdL) ido s —f s 

(Shiftid) 1 oid-*1 

(ShiftCons) T o (a • s) ->■ s 



Fig. 7. Rewrite rules for the Ac-calculus 



of substitutions is essential in this calculus even to the proper interpretation of 
variables bound by abstractions. 

The rules that define the Acr-calcuhis are presented in Figure 7. In this collection, 
the (Beta) rule serves to simulate beta contraction. The remaining rules, that define 
the subsystem a, are meant to propagate substitutions generated by the (Beta) 
rule. The a rules in the left column compute the effect of substitutions on terms. 
The (Clos) rule may generate a composition of substitutions in this process that 
the rules in the right column are useful in unravelling. Given two terms or two 
substitutions ?y, and v . we write ?i Ox^r v or u [>„ v to denote the fact that v results 
by replacing an appropriate subpart of u using any of these rules or only one of the 
a rules, respectively. The reflexive and transitive closure of these relations is, as 
usual, denoted by [>^^ and >*. 

It is useful to understand the manner in which the rules of the Acr-calculus 
function in the task of normalizing terms as a prelude to contrasting it with 
the suspension calculus. Towards this end, consider the lambda term given by 
(A A ((A A A #3) #2)) in the suspension calculus. This term is encoded by 

(AA((AAAl[f])l[r])) 

in the Acr-calculus. Applying the (Beta) rule to the only redex in this term we get 

(AA((AAl[f])[l[T]-id])). 

The substitution generated by beta contraction can now be moved inside the two 
abstractions using the (Abs) rule to get the term 

(AAAA(l[fl[l •(! •((![?] -id) ot) ot)]). 

The substitution (1 • ((l[t] • id) o |) o |) that appears in this expression depicts 
the iterated adjustment of substitutions as they are pushed under abstractions in 
Acr-calculus; by contrast, the suspension calculus captures the needed renumbering 
simply by a global adjustment to the new embedding level. The next conceptual 
step in the reduction is that of "looking up" the binding for the variable represented 
by l[t^] in the substitution. This step requires the possible use of (ShiftCons) to 
prune off an initial portion of the substitution and an eventual use of (Varld) to 
select the desired term. However, the encoding of abstracted variables necessitates 
the use of the rules (Clos), (Ass) and (Map) to prepare the situation for applying 
these rules. The term that results at the end of this process is (A A A A l[(t o |) o |]). 
The (Ass) rule can now be used to transform the term under all the abstractions 
into the form 1[| o (| o |)] that is recognizable as the encoding of a de Bruijn index. 
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4.2.2 Translating suspension expressions into Xa -expressions. The non-trivial 
part of this mapping concerns the treatment of environments in the suspension 
calculus. Intuitively, these must correspond to substitutions in the Acr-calculus. 
However, environments obtain a meaning only relative to the new embedding level of 
the suspension terms they appear in. Moreover, to be well-formed, this embedding 
level must be at least as large as the level of the environment itself. Once this 
constraint is satisfied, the example just considered suggests the right translation to 
a "standalone" substitution. 

Definition 4.7. The mappings S from suspension terms to Acr-terms and R from 

pairs constituted by a suspension environment e and a natural number i such that 
lev{e) < i to Acr-substitutions are defined simultaneously by recursion as follows: 

(1) = 1, S'(#(n+1)) = l[r] if n > 0, S{a b) = {S{a) S{b)), S{Xa) = XS{a) 

and S{lt,ol,nl,el) = S{t)[Rie,nl)]. 

i occurrences of T 



(2) R{e,i)={ 



(. . . {{id o I) o I) • • • ) o I if e = nil 

i—n occurrences of t 



(. . . {iiS{t) ■ R{e', n)) o t) o t) • • • ) ° T if e = n) :: eJ and 
i?(ei, n/i) o R{e2,i - {nli - 0I2)) if e = -g^ei, nli, 0/2, (52}}- 



The constraint on the pairs that R applies to raises a question concerning the well- 
definedness of /?, and hence also of S. However, the well-formedness requirement 
on suspension expressions in Definition 2.3 ensures that these must be well-defined. 
Another fact that is easy to verify is that these mappings are both one-to-one; the 
critical observation in this regard is that Definition 4.7 is constructed so that i?(e, i) 
is not equal to t"' for any e, i and j. Finally, we observe a correspondence also at 
the level of the rewriting: 

Theorem 4.8. Let u and v he suspension expressions such that ut>„ni^ (u > 
rmffs'f)- U'>^ '^'^'^ ^ 'i'^e terms, then there exists a Xa-term w such that S{u) and 
S{v) >* w (respectively, S{u) >j^^ w and S{v) [>^^w). Ifu and v are environments, 
then for any i such that lev{u) < i, there is a Xa -substitution w such that R{u, i) \>* 

w and R{v, i) t>* w (respectively R{u, i) Oj^^ w and R{v, i) t>^^ w). 

Proof. Applications of the rules (/3s), (r5), (r6), (ml) and (m3) on suspension 
expressions map directly onto applications of (Beta), (App), (Abs), (Clos) and 
(IdL), respectively, on their translations. Rule (r2) that corresponds to renumbering 
a de Bruijn index translates into a sequence of uses of the (Map) and (Ass) rules 
in accordance with the representation of abstracted variables in the Au-calculus. 
Rule (r3) is similar to the rule (VarCons). However, the translation of the Icfthand 
side must be "prepared" for the use of (VarCons) by a sequence of applications of 
(Map) and a peculiarity of the translation of the righthand side may require (IdL) 
to be used on it to produce a common form. In a similar sense, the rules (r4), (m4) 
and (m5) correspond to a "compiled form" of (ShiftCons) and (m6) corresponds 
to a compiled form of (Map). Finally, rule (m2) is similar to the use of (Ass) in 
producing a normal form. □ 
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4.2.3 Translating Xa- expressions into suspension expressions. Going in the re- 
verse direction needs a decision on the range of the mapping for Aa-substitutions. 
Considering a term of the form a[s] indicates what this might be. Such a term 
should translate into a suspension of the form |i, ol, nl, e] where the triple {ol, nl, e) 
is obtained by "interpreting" s. In the case when every composition in s has a shift 
as its right operand, this triple can be arrived at in a natural way: e should reflect 
the substitution terms in s, should be the number of such terms and nl, which 
counts the number of enclosing abstractions, should correspond to the length of 
the longest sequence of compositions with shifts at the top level in ,s. The intu- 
ition underlying the encoding of general substitution composition in the suspension 
calculus now allows this translation to be extended to arbitrary Acr-substitutions. 

Definition 4.9. The mapping T from Atr-terms to suspension terms and the map- 
ping E from Acr-substitutions to triples of an old embedding level, a new embedding 
level, and a suspension environment are defined simultaneously by recursion as fol- 
lows: 

(1) r(l) = #1, T{a b) = {T(a) T(b)), T{Xa) = XT{a) and T{a[s]) is #(n + 1) 
if a is 1 and s is t" for n > and is |T(a), o/, nZ, e| where E{s) = {ol,nl,e) 
otherwise. 

(2) E{id) = (0,0, nil), E^^) = {0,1, nil), E{a ■ s) = {ol + l,nl,{T{a),nl) :: e) 

where E{s) = {ol,nl,e), and E{si o S2) is {oli,nli + l,ei) if S2 is f and is 
{oil + {0I2 — nli),nl2 + {nil — 0I2), |^ei, n^i, 0I2, 62 J) otherwise, assuming that 
E{si) = {oli,nli,ei) and E{s2) = (0Z2, 62). 

It is easily seen that, for any term a of the Acr-calculus, T{a) is a well-formed 
suspension term. The translation treats a term of the form l[t"] as a special 
case, reflecting its interpretation as the encoding of an abstracted variable. If this 
case were not singled out, the translation would produce the term |#l,0,nZ,m/J 
instead. This term can be rewritten to + 1) by the rule (r2). A similar 
observation applies to the translation of s o | . This case is treated as a special one 
to account for the manner in which a substitution is moved under an abstraction in 
the Acr-calculus. If this issue were to be ignored, this substitution would translate to 
{ol, nl + 1, ^e, nl, 0, nilj) instead of {ol, nl + 1, e), assuming that E{s) = {ol, nl, e). 
The environment component of the former triple rewrites to that of the latter by 
the rule (m2). 

The following theorem, whose proof is trivial, is evidence of the naturalness of 
our translations: 

Theorem 4.10. For every suspension term t, T{S{t)) = t. 

In order to state a correspondence between the rewrite systems, we need to extend 
the reduction relations on suspension expressions to triples of the form {ol, nl, e) 
that are the targets of the mapping E. We do this in the obvious way: a triple 
{ol, nl, e) is related to {ol, nl, e') by a rewriting relation just in case e is related to 
e' by that relation. 

Theorem 4.11. If a and b are Xa-terms such that af>^b (a\>\a-b), then there is a 
suspension-term u such that T{a)\>*^u (T{a)>*^0^u) andT{b)'>*^u (T{b)>*^0^u). 
If s andt are Xa- substitutions such thatst>at (s>\at) then there exist environments 
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ei and €2 such that E{s)l>*^{ol,nl,ei) (E{s) l>*^i^^{ol,nl,ei)), E{t)o*^{ol,nl, 62) 
(E{t)>*^{ol,nl, 62)) and ei ~ 62- 

Proof. The argument is by induction on the structure of Acr-expressions. The- 
orem 3.15 permits us to focus on the situation where rewriting takes place at the 
root of the expression. Also, the observations about the "redundancy" of the special 
cases in the definitions of T and E allow us to ignore them in the proof. 

Now, we can observe a relationship between several of the rules in the Acr-calculus 
and rules in the suspension calculus: (Beta) corresponds to (/3s), (App) to (r5), 
(Abs) to (r6), (Varld) to (a special case of) (r2), (VarCons) to (r3), (Clos) to 
(ml), (IdL) to (m3), (Shiftid) to (m2) and (ShiftCons) to (m4). In some cases 
the correspondence is precise in that the translation of the Icfthand side rewrites 
exactly to the translation of the righthand side by the indicated rule. However, in 
most cases, some "adjustments" using other reading and merging rules are needed 
before or after the specific rule application to account for the peculiarities of the 
different calculi. 

The two rules that remain are (Map) and (Ass). The former corresponds to (m6) 
but the correspondence is not quite the same as with the other rules. Suppose 
(a • si) o S2 rewrites to a[s2] ■ {si o S2) by this rule. Let T{a) = t, E{si) = 
{oil, nil, ei), a.ndE{s2) = (0^2, ^^^2, 62). The index components of £J((a-Si)os2) and 
E{a[s2] ■ (si 0S2)) are quickly seen to be identical. The environment components are 
{^{t,nli) :: ei,nli, 0^,62} and (|i, 0^2, nl2, 62], nl2 + {nh -0I2)) :: |ei, n/i, 0^2, 62 J, 
respectively. These are like the left and right sides of rule (m6) with two differences. 
First, 62 might not have the form {s,l) :: e'2 that is needed by rule (m6). This can 
be "fixed" by rewriting 62 at the outset to such a form^. The second difference 
is that the index of the first environment term on the right side uses nh where 
rule (m6) uses I. However, this is not a problem because the two environments are 
claimed only to be similar, not identical. 

Finally, turning to (Ass), we see that there is no rule in the suspension calculus 
that "simulates" it. Rather, this rule corresponds to a meta property of the calculus 
that was proved in Lemma 3.10. □ 

4.2.4 Meta Variables and Preservation of Strong Norraalizahility. Our presen- 
tation of the Acr-calculus is true to its original description in [Abadi et al. 1991]. 
This rewrite system is not confiuent when the syntax of terms is extended to in- 
clude graftablc meta variables. However, straightforward additions to the rule set 
suffice to regain this property [Curien et al. 1996]; see also [Dowek et al. 2000] for 
a system closer in form to the one discussed in this paper. 

The Acr-calculus docs not preserve strong normalizability as we have already 
noted, although the substitution subsystem cr is strongly normalizing. The crux 
of the problem is that the (Beta) rule and the substitution rules can interact with 
each other to get a substitution to scope over its own subcomponents. To sec how 
this might happen, consider the following reduction sequence adapted from [Mellies 
1995]: 

((A a') b')[{{\a) b)-id\ 



^For completeness, the case where 62 reduces to nil must also be discussed. (Map) in this case is 
related to (r2) and the argument is easier. 
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>; (A(a'[l-((((Aa) 6) • zd) o j)])) b'[{{Xa) b) ■ id] 

>Beta a'[l ■ ((((A a) b) ■ id) o T)][6'[((Aa) 6) ■ id] • id] 

>; a'[b'[{{Xa) 6) • id] ■ ((((A a) b) ■ id) o (t o (6'[((Ao) b)-id\- id)))] 

The substitution (f o (&'[((Aa) 6) • id] ■ id)) that appears as a subexpression of the 
last term in this sequence would be rewritten to id in a sensible progression to a 
normal form. However, it can also perversely be distributed over the preceding 
substitution using (Map) to produce the substitution subexpression 

((Aa) 6)[t o (6'[((Aa) b) ■ id] ■ id)] ■ (ido(t o (6'[((Aa) b) ■ id] ■ id))). 

Observe here that [((A a) b) ■ id] has become a subpart of a substitution that stands 
over the term ((A a) b) that originates from itself. 

The preservation of strong normalizability is still an unsettled question with re- 
gard to the suspension calculus. However, Mellies' counterexample does not apply 
to this calculus because the kind of problem situation depicted above cannot be 
created within it. In particular, rule (m6) that corresponds to (Map) in the sus- 
pension calculus ensures that only relevant portions of an external environment are 
distributed over substitution terms. 

5. CONCLUSION 

This paper has presented a simplified and rationalized version of the suspension 
calculus. The new notation has several pleasing theoretical and practical properties 
some of which have been manifest here. This version also differs from the original 
presentation in that it preserves contextual information. This characteristic has 
been central to our ability to describe translations to the Acr-calculus and has also 
been exploited elsewhere in defining a system for type assignment [Gacek 2006b]. 
This paper has also surveyed the world of explicit substitution calculi. It has 
attempted to do this in a top-down fashion, first elucidating properties that are 
important for such calculi to possess and then using these to categorize and to 
explain the motivations for the different proposed systems. In the process we have 
also distilled a better understanding of the capabilities of the suspension calculus. 

This work can be extended in several ways. We mention two that we think are 
especially important. First, like the Acr-calculus, the notation we have described 
here provides the basis for incorporating new treatments of higher-order unification 
that exploit graftable mcta variables into practical systems. It is of interest to 
actually explicate such a treatment and to evaluate its benefits empirically. Second, 
the question of preservation of strong normalizability is still an open one for this 
calculus. This issue appears to be a non-trivial one to settle and an answer to it is 
likely to provide significant insights into the structure of the suspension calculus. 
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